Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 678012 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 64.7 MiB |
| Average record size in memory | 100.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
Area is highly overall correlated with Density | High correlation |
BonusMalus is highly overall correlated with DrivAge | High correlation |
Density is highly overall correlated with Area | High correlation |
DrivAge is highly overall correlated with BonusMalus | High correlation |
IDpol has unique values | Unique |
VehAge has 57739 (8.5%) zeros | Zeros |
ClaimCount has 643952 (95.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-27 08:10:45.608374 |
|---|---|
| Analysis finished | 2025-08-27 08:11:03.106006 |
| Duration | 17.5 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
IDpol
Real number (ℝ)
Unique 
| Distinct | 678012 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2621860.6 |
| Minimum | 1 |
|---|---|
| Maximum | 6114330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 69365.55 |
| Q1 | 1157951.8 |
| median | 2272152.5 |
| Q3 | 4046274.2 |
| 95-th percentile | 6014195.3 |
| Maximum | 6114330 |
| Range | 6114329 |
| Interquartile range (IQR) | 2888322.5 |
Descriptive statistics
| Standard deviation | 1641781.1 |
|---|---|
| Coefficient of variation (CV) | 0.62618931 |
| Kurtosis | -0.65834502 |
| Mean | 2621860.6 |
| Median Absolute Deviation (MAD) | 1152062 |
| Skewness | 0.23788976 |
| Sum | 1.777653 × 1012 |
| Variance | 2.6954452 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 72150 | 1 | < 0.1% |
| 2124053 | 1 | < 0.1% |
| 1049168 | 1 | < 0.1% |
| 134313 | 1 | < 0.1% |
| 1145209 | 1 | < 0.1% |
| 2281532 | 1 | < 0.1% |
| 4122208 | 1 | < 0.1% |
| 4128877 | 1 | < 0.1% |
| 2102858 | 1 | < 0.1% |
| 1106637 | 1 | < 0.1% |
| Other values (678002) | 678002 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 13 | 1 | |
| 15 | 1 | |
| 17 | 1 | |
| 18 | 1 | |
| 21 | 1 |
| Value | Count | Frequency (%) |
| 6114330 | 1 | |
| 6114329 | 1 | |
| 6114328 | 1 | |
| 6114327 | 1 | |
| 6114326 | 1 | |
| 6114325 | 1 | |
| 6114324 | 1 | |
| 6114323 | 1 | |
| 6114322 | 1 | |
| 6114321 | 1 |
VehPower
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4546306 |
| Minimum | 4 |
|---|---|
| Maximum | 15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 15 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.0509071 |
|---|---|
| Coefficient of variation (CV) | 0.31774198 |
| Kurtosis | 1.6682028 |
| Mean | 6.4546306 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1713449 |
| Sum | 4376317 |
| Variance | 4.2062199 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 148976 | |
| 7 | 145400 | |
| 5 | 124821 | |
| 4 | 115349 | |
| 8 | 46956 | 6.9% |
| 10 | 31354 | 4.6% |
| 9 | 30085 | 4.4% |
| 11 | 18352 | 2.7% |
| 12 | 8214 | 1.2% |
| 13 | 3229 | 0.5% |
| Other values (2) | 5276 | 0.8% |
| Value | Count | Frequency (%) |
| 4 | 115349 | |
| 5 | 124821 | |
| 6 | 148976 | |
| 7 | 145400 | |
| 8 | 46956 | 6.9% |
| 9 | 30085 | 4.4% |
| 10 | 31354 | 4.6% |
| 11 | 18352 | 2.7% |
| 12 | 8214 | 1.2% |
| 13 | 3229 | 0.5% |
| Value | Count | Frequency (%) |
| 15 | 2926 | 0.4% |
| 14 | 2350 | 0.3% |
| 13 | 3229 | 0.5% |
| 12 | 8214 | 1.2% |
| 11 | 18352 | 2.7% |
| 10 | 31354 | 4.6% |
| 9 | 30085 | 4.4% |
| 8 | 46956 | 6.9% |
| 7 | 145400 | |
| 6 | 148976 |
VehAge
Real number (ℝ)
Zeros 
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0442603 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 57739 |
| Zeros (%) | 8.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 11 |
| 95-th percentile | 17 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.6662346 |
|---|---|
| Coefficient of variation (CV) | 0.8043761 |
| Kurtosis | 6.522051 |
| Mean | 7.0442603 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.1479909 |
| Sum | 4776093 |
| Variance | 32.106215 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 71284 | 10.5% |
| 2 | 59124 | 8.7% |
| 0 | 57739 | 8.5% |
| 3 | 50261 | 7.4% |
| 4 | 43492 | 6.4% |
| 5 | 38737 | 5.7% |
| 10 | 38394 | 5.7% |
| 6 | 35717 | 5.3% |
| 7 | 32880 | 4.8% |
| 8 | 32680 | 4.8% |
| Other values (68) | 217704 |
| Value | Count | Frequency (%) |
| 0 | 57739 | |
| 1 | 71284 | |
| 2 | 59124 | |
| 3 | 50261 | |
| 4 | 43492 | |
| 5 | 38737 | |
| 6 | 35717 | |
| 7 | 32880 | |
| 8 | 32680 | |
| 9 | 31922 |
| Value | Count | Frequency (%) |
| 100 | 25 | |
| 99 | 23 | |
| 85 | 1 | < 0.1% |
| 84 | 1 | < 0.1% |
| 83 | 2 | < 0.1% |
| 82 | 1 | < 0.1% |
| 81 | 3 | < 0.1% |
| 80 | 3 | < 0.1% |
| 79 | 1 | < 0.1% |
| 78 | 1 | < 0.1% |
DrivAge
Real number (ℝ)
High correlation 
| Distinct | 83 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.499153 |
| Minimum | 18 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 34 |
| median | 44 |
| Q3 | 55 |
| 95-th percentile | 72 |
| Maximum | 100 |
| Range | 82 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 14.13743 |
|---|---|
| Coefficient of variation (CV) | 0.31071854 |
| Kurtosis | -0.34268603 |
| Mean | 45.499153 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.43575894 |
| Sum | 30848972 |
| Variance | 199.86694 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 17530 | 2.6% |
| 38 | 17346 | 2.6% |
| 39 | 17320 | 2.6% |
| 37 | 17295 | 2.6% |
| 52 | 17195 | 2.5% |
| 34 | 17059 | 2.5% |
| 40 | 17017 | 2.5% |
| 51 | 17016 | 2.5% |
| 41 | 16977 | 2.5% |
| 42 | 16953 | 2.5% |
| Other values (73) | 506304 |
| Value | Count | Frequency (%) |
| 18 | 748 | 0.1% |
| 19 | 2392 | 0.4% |
| 20 | 3676 | 0.5% |
| 21 | 4437 | 0.7% |
| 22 | 5291 | |
| 23 | 6261 | |
| 24 | 7392 | |
| 25 | 8697 | |
| 26 | 10301 | |
| 27 | 11827 |
| Value | Count | Frequency (%) |
| 100 | 3 | < 0.1% |
| 99 | 70 | |
| 98 | 5 | < 0.1% |
| 97 | 10 | < 0.1% |
| 96 | 15 | < 0.1% |
| 95 | 24 | < 0.1% |
| 94 | 32 | < 0.1% |
| 93 | 55 | |
| 92 | 66 | |
| 91 | 121 |
BonusMalus
Real number (ℝ)
High correlation 
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.761464 |
| Minimum | 50 |
|---|---|
| Maximum | 230 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 50 |
| median | 50 |
| Q3 | 64 |
| 95-th percentile | 95 |
| Maximum | 230 |
| Range | 180 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 15.636639 |
|---|---|
| Coefficient of variation (CV) | 0.26165087 |
| Kurtosis | 2.6748529 |
| Mean | 59.761464 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.7289437 |
| Sum | 40518990 |
| Variance | 244.50448 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 384156 | |
| 100 | 19530 | 2.9% |
| 68 | 18791 | 2.8% |
| 72 | 18580 | 2.7% |
| 76 | 18226 | 2.7% |
| 64 | 18192 | 2.7% |
| 80 | 18086 | 2.7% |
| 57 | 17938 | 2.6% |
| 60 | 17363 | 2.6% |
| 54 | 17360 | 2.6% |
| Other values (105) | 129790 | 19.1% |
| Value | Count | Frequency (%) |
| 50 | 384156 | |
| 51 | 15869 | 2.3% |
| 52 | 4770 | 0.7% |
| 53 | 3351 | 0.5% |
| 54 | 17360 | 2.6% |
| 55 | 5593 | 0.8% |
| 56 | 3453 | 0.5% |
| 57 | 17938 | 2.6% |
| 58 | 5970 | 0.9% |
| 59 | 2779 | 0.4% |
| Value | Count | Frequency (%) |
| 230 | 1 | < 0.1% |
| 228 | 1 | < 0.1% |
| 218 | 1 | < 0.1% |
| 208 | 1 | < 0.1% |
| 198 | 2 | < 0.1% |
| 196 | 3 | |
| 195 | 6 | |
| 190 | 3 | |
| 187 | 3 | |
| 185 | 5 |
VehBrand
Categorical
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| B12 | |
|---|---|
| B1 | |
| B2 | |
| B3 | |
| B5 | |
| Other values (6) |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.3149517 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B2 |
|---|---|
| 2nd row | B12 |
| 3rd row | B1 |
| 4th row | B12 |
| 5th row | B1 |
Common Values
| Value | Count | Frequency (%) |
| B12 | 166024 | |
| B1 | 162736 | |
| B2 | 159861 | |
| B3 | 53394 | 7.9% |
| B5 | 34753 | 5.1% |
| B6 | 28548 | 4.2% |
| B4 | 25179 | 3.7% |
| B10 | 17707 | 2.6% |
| B11 | 13585 | 2.0% |
| B13 | 12178 | 1.8% |
Length
| Value | Count | Frequency (%) |
| b12 | 166024 | |
| b1 | 162736 | |
| b2 | 159861 | |
| b3 | 53394 | 7.9% |
| b5 | 34753 | 5.1% |
| b6 | 28548 | 4.2% |
| b4 | 25179 | 3.7% |
| b10 | 17707 | 2.6% |
| b11 | 13585 | 2.0% |
| b13 | 12178 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 678012 | |
| 1 | 389862 | |
| 2 | 325885 | |
| 3 | 65572 | 4.2% |
| 5 | 34753 | 2.2% |
| 4 | 29226 | 1.9% |
| 6 | 28548 | 1.8% |
| 0 | 17707 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1569565 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 678012 | |
| 1 | 389862 | |
| 2 | 325885 | |
| 3 | 65572 | 4.2% |
| 5 | 34753 | 2.2% |
| 4 | 29226 | 1.9% |
| 6 | 28548 | 1.8% |
| 0 | 17707 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1569565 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 678012 | |
| 1 | 389862 | |
| 2 | 325885 | |
| 3 | 65572 | 4.2% |
| 5 | 34753 | 2.2% |
| 4 | 29226 | 1.9% |
| 6 | 28548 | 1.8% |
| 0 | 17707 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1569565 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 678012 | |
| 1 | 389862 | |
| 2 | 325885 | |
| 3 | 65572 | 4.2% |
| 5 | 34753 | 2.2% |
| 4 | 29226 | 1.9% |
| 6 | 28548 | 1.8% |
| 0 | 17707 | 1.1% |
VehGas
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Regular | |
|---|---|
| Diesel |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.5101326 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Diesel |
|---|---|
| 2nd row | Regular |
| 3rd row | Regular |
| 4th row | Regular |
| 5th row | Diesel |
Common Values
| Value | Count | Frequency (%) |
| Regular | 345876 | |
| Diesel | 332136 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| regular | 345876 | |
| diesel | 332136 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1010148 | |
| l | 678012 | |
| R | 345876 | 7.8% |
| g | 345876 | 7.8% |
| u | 345876 | 7.8% |
| a | 345876 | 7.8% |
| r | 345876 | 7.8% |
| D | 332136 | 7.5% |
| i | 332136 | 7.5% |
| s | 332136 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4413948 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1010148 | |
| l | 678012 | |
| R | 345876 | 7.8% |
| g | 345876 | 7.8% |
| u | 345876 | 7.8% |
| a | 345876 | 7.8% |
| r | 345876 | 7.8% |
| D | 332136 | 7.5% |
| i | 332136 | 7.5% |
| s | 332136 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4413948 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1010148 | |
| l | 678012 | |
| R | 345876 | 7.8% |
| g | 345876 | 7.8% |
| u | 345876 | 7.8% |
| a | 345876 | 7.8% |
| r | 345876 | 7.8% |
| D | 332136 | 7.5% |
| i | 332136 | 7.5% |
| s | 332136 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4413948 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1010148 | |
| l | 678012 | |
| R | 345876 | 7.8% |
| g | 345876 | 7.8% |
| u | 345876 | 7.8% |
| a | 345876 | 7.8% |
| r | 345876 | 7.8% |
| D | 332136 | 7.5% |
| i | 332136 | 7.5% |
| s | 332136 | 7.5% |
Area
Categorical
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| C | |
|---|---|
| D | |
| E | |
| A | |
| B |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | D |
| 3rd row | E |
| 4th row | C |
| 5th row | E |
Common Values
| Value | Count | Frequency (%) |
| C | 191880 | |
| D | 151595 | |
| E | 137167 | |
| A | 103957 | |
| B | 75459 | 11.1% |
| F | 17954 | 2.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| c | 191880 | |
| d | 151595 | |
| e | 137167 | |
| a | 103957 | |
| b | 75459 | 11.1% |
| f | 17954 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 191880 | |
| D | 151595 | |
| E | 137167 | |
| A | 103957 | |
| B | 75459 | 11.1% |
| F | 17954 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 191880 | |
| D | 151595 | |
| E | 137167 | |
| A | 103957 | |
| B | 75459 | 11.1% |
| F | 17954 | 2.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 191880 | |
| D | 151595 | |
| E | 137167 | |
| A | 103957 | |
| B | 75459 | 11.1% |
| F | 17954 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 191880 | |
| D | 151595 | |
| E | 137167 | |
| A | 103957 | |
| B | 75459 | 11.1% |
| F | 17954 | 2.6% |
Density
Real number (ℝ)
High correlation 
| Distinct | 1607 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1792.4223 |
| Minimum | 1 |
|---|---|
| Maximum | 27000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 92 |
| median | 393 |
| Q3 | 1658 |
| 95-th percentile | 7313 |
| Maximum | 27000 |
| Range | 26999 |
| Interquartile range (IQR) | 1566 |
Descriptive statistics
| Standard deviation | 3958.6495 |
|---|---|
| Coefficient of variation (CV) | 2.2085473 |
| Kurtosis | 24.86941 |
| Mean | 1792.4223 |
| Median Absolute Deviation (MAD) | 355 |
| Skewness | 4.6514178 |
| Sum | 1.2152838 × 109 |
| Variance | 15670906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 27000 | 10515 | 1.6% |
| 3317 | 9891 | 1.5% |
| 1313 | 7157 | 1.1% |
| 9307 | 5986 | 0.9% |
| 3744 | 5540 | 0.8% |
| 1326 | 5447 | 0.8% |
| 405 | 5195 | 0.8% |
| 4128 | 5055 | 0.7% |
| 4762 | 4985 | 0.7% |
| 57 | 4262 | 0.6% |
| Other values (1597) | 613979 |
| Value | Count | Frequency (%) |
| 1 | 7 | < 0.1% |
| 2 | 92 | < 0.1% |
| 3 | 304 | < 0.1% |
| 4 | 274 | < 0.1% |
| 5 | 438 | 0.1% |
| 6 | 752 | 0.1% |
| 7 | 1088 | 0.2% |
| 8 | 1131 | 0.2% |
| 9 | 1813 | |
| 10 | 2911 |
| Value | Count | Frequency (%) |
| 27000 | 10515 | |
| 23396 | 66 | < 0.1% |
| 22821 | 182 | < 0.1% |
| 22669 | 463 | 0.1% |
| 21410 | 76 | < 0.1% |
| 20000 | 6 | < 0.1% |
| 18229 | 200 | < 0.1% |
| 17140 | 910 | 0.1% |
| 16533 | 613 | 0.1% |
| 16291 | 175 | < 0.1% |
Region
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.2 MiB |
| Centre | |
|---|---|
| Rhone-Alpes | |
| Provence-Alpes-Cotes-D'Azur | |
| Ile-de-France | |
| Bretagne | |
| Other values (16) |
Length
| Max length | 27 |
|---|---|
| Median length | 17 |
| Mean length | 12.962977 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Centre |
|---|---|
| 2nd row | Pays-de-la-Loire |
| 3rd row | Provence-Alpes-Cotes-D'Azur |
| 4th row | Pays-de-la-Loire |
| 5th row | Provence-Alpes-Cotes-D'Azur |
Common Values
| Value | Count | Frequency (%) |
| Centre | 160601 | |
| Rhone-Alpes | 84751 | |
| Provence-Alpes-Cotes-D'Azur | 79315 | |
| Ile-de-France | 69791 | |
| Bretagne | 42122 | 6.2% |
| Nord-Pas-de-Calais | 40275 | 5.9% |
| Pays-de-la-Loire | 38751 | 5.7% |
| Languedoc-Roussillon | 35805 | 5.3% |
| Aquitaine | 31329 | 4.6% |
| Poitou-Charentes | 19046 | 2.8% |
| Other values (11) | 76226 |
Length
| Value | Count | Frequency (%) |
| centre | 160601 | |
| rhone-alpes | 84751 | |
| provence-alpes-cotes-d'azur | 79315 | |
| ile-de-france | 69791 | |
| bretagne | 42122 | 6.2% |
| nord-pas-de-calais | 40275 | 5.9% |
| pays-de-la-loire | 38751 | 5.7% |
| languedoc-roussillon | 35805 | 5.3% |
| aquitaine | 31329 | 4.6% |
| poitou-charentes | 19046 | 2.8% |
| Other values (11) | 76226 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1462867 | |
| - | 795377 | 9.0% |
| n | 626133 | 7.1% |
| r | 598675 | 6.8% |
| o | 518984 | 5.9% |
| s | 503548 | 5.7% |
| a | 453346 | 5.2% |
| l | 386693 | 4.4% |
| t | 361569 | 4.1% |
| C | 308105 | 3.5% |
| Other values (24) | 2773757 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8789054 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1462867 | |
| - | 795377 | 9.0% |
| n | 626133 | 7.1% |
| r | 598675 | 6.8% |
| o | 518984 | 5.9% |
| s | 503548 | 5.7% |
| a | 453346 | 5.2% |
| l | 386693 | 4.4% |
| t | 361569 | 4.1% |
| C | 308105 | 3.5% |
| Other values (24) | 2773757 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8789054 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1462867 | |
| - | 795377 | 9.0% |
| n | 626133 | 7.1% |
| r | 598675 | 6.8% |
| o | 518984 | 5.9% |
| s | 503548 | 5.7% |
| a | 453346 | 5.2% |
| l | 386693 | 4.4% |
| t | 361569 | 4.1% |
| C | 308105 | 3.5% |
| Other values (24) | 2773757 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8789054 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1462867 | |
| - | 795377 | 9.0% |
| n | 626133 | 7.1% |
| r | 598675 | 6.8% |
| o | 518984 | 5.9% |
| s | 503548 | 5.7% |
| a | 453346 | 5.2% |
| l | 386693 | 4.4% |
| t | 361569 | 4.1% |
| C | 308105 | 3.5% |
| Other values (24) | 2773757 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 678012 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 136195 | |
| 5 | 135569 | |
| 4 | 135554 | |
| 2 | 135378 | |
| 3 | 135316 |
Exposure
Real number (ℝ)
| Distinct | 181 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.52874953 |
| Minimum | 0.00273224 |
|---|---|
| Maximum | 2.01 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.2 MiB |
Quantile statistics
| Minimum | 0.00273224 |
|---|---|
| 5-th percentile | 0.04 |
| Q1 | 0.18 |
| median | 0.49 |
| Q3 | 0.99 |
| 95-th percentile | 1 |
| Maximum | 2.01 |
| Range | 2.0072678 |
| Interquartile range (IQR) | 0.81 |
Descriptive statistics
| Standard deviation | 0.36444151 |
|---|---|
| Coefficient of variation (CV) | 0.68925169 |
| Kurtosis | -1.5242423 |
| Mean | 0.52874953 |
| Median Absolute Deviation (MAD) | 0.37 |
| Skewness | 0.08532088 |
| Sum | 358498.53 |
| Variance | 0.13281761 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 168125 | |
| 0.08 | 44670 | 6.6% |
| 0.07 | 12969 | 1.9% |
| 0.24 | 12950 | 1.9% |
| 0.5 | 12497 | 1.8% |
| 0.49 | 12298 | 1.8% |
| 0.03 | 11996 | 1.8% |
| 0.04 | 11131 | 1.6% |
| 0.12 | 11047 | 1.6% |
| 0.2 | 8727 | 1.3% |
| Other values (171) | 371602 |
| Value | Count | Frequency (%) |
| 0.00273224 | 1060 | 0.2% |
| 0.002739726 | 2045 | 0.3% |
| 0.005464481 | 609 | 0.1% |
| 0.005479452 | 1396 | 0.2% |
| 0.008196721 | 620 | 0.1% |
| 0.008219178 | 1147 | 0.2% |
| 0.01 | 6726 | |
| 0.02 | 5656 | |
| 0.03 | 11996 | |
| 0.04 | 11131 |
| Value | Count | Frequency (%) |
| 2.01 | 2 | |
| 2 | 1 | |
| 1.99 | 1 | |
| 1.98 | 1 | |
| 1.93 | 1 | |
| 1.92 | 1 | |
| 1.9 | 2 | |
| 1.88 | 1 | |
| 1.85 | 2 | |
| 1.82 | 1 |
ClaimCount
Real number (ℝ)
Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.052447449 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 643952 |
| Zeros (%) | 95.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.23480476 |
|---|---|
| Coefficient of variation (CV) | 4.476953 |
| Kurtosis | 73.267147 |
| Mean | 0.052447449 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.4175084 |
| Sum | 35560 |
| Variance | 0.055133278 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 643952 | |
| 1 | 32687 | 4.8% |
| 2 | 1298 | 0.2% |
| 3 | 62 | < 0.1% |
| 4 | 5 | < 0.1% |
| 11 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 643952 | |
| 1 | 32687 | 4.8% |
| 2 | 1298 | 0.2% |
| 3 | 62 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 5 | < 0.1% |
| 3 | 62 | < 0.1% |
| 2 | 1298 | 0.2% |
| 1 | 32687 |
Interactions
Correlations
| Area | BonusMalus | ClaimCount | Density | DrivAge | Exposure | Group | IDpol | Region | VehAge | VehBrand | VehGas | VehPower | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Area | 1.000 | 0.051 | 0.006 | 0.589 | 0.029 | 0.058 | 0.000 | 0.056 | 0.318 | 0.040 | 0.073 | 0.131 | 0.038 |
| BonusMalus | 0.051 | 1.000 | 0.038 | 0.139 | -0.571 | -0.196 | 0.000 | -0.011 | 0.029 | 0.082 | 0.021 | 0.048 | -0.068 |
| ClaimCount | 0.006 | 0.038 | 1.000 | 0.013 | 0.012 | 0.071 | 0.000 | -0.141 | 0.005 | -0.022 | 0.000 | 0.002 | -0.003 |
| Density | 0.589 | 0.139 | 0.013 | 1.000 | -0.044 | -0.123 | 0.001 | 0.062 | 0.235 | -0.102 | 0.049 | 0.102 | -0.012 |
| DrivAge | 0.029 | -0.571 | 0.012 | -0.044 | 1.000 | 0.164 | 0.000 | 0.058 | 0.041 | -0.078 | 0.051 | 0.119 | 0.040 |
| Exposure | 0.058 | -0.196 | 0.071 | -0.123 | 0.164 | 1.000 | 0.000 | -0.157 | 0.091 | 0.184 | 0.091 | 0.040 | -0.036 |
| Group | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 | 1.000 | 0.000 | 0.001 | 0.002 | 0.000 | 0.000 | 0.000 |
| IDpol | 0.056 | -0.011 | -0.141 | 0.062 | 0.058 | -0.157 | 0.000 | 1.000 | 0.138 | -0.120 | 0.230 | 0.050 | 0.032 |
| Region | 0.318 | 0.029 | 0.005 | 0.235 | 0.041 | 0.091 | 0.001 | 0.138 | 1.000 | 0.066 | 0.130 | 0.087 | 0.045 |
| VehAge | 0.040 | 0.082 | -0.022 | -0.102 | -0.078 | 0.184 | 0.002 | -0.120 | 0.066 | 1.000 | 0.110 | 0.127 | -0.002 |
| VehBrand | 0.073 | 0.021 | 0.000 | 0.049 | 0.051 | 0.091 | 0.000 | 0.230 | 0.130 | 0.110 | 1.000 | 0.116 | 0.154 |
| VehGas | 0.131 | 0.048 | 0.002 | 0.102 | 0.119 | 0.040 | 0.000 | 0.050 | 0.087 | 0.127 | 0.116 | 1.000 | 0.280 |
| VehPower | 0.038 | -0.068 | -0.003 | -0.012 | 0.040 | -0.036 | 0.000 | 0.032 | 0.045 | -0.002 | 0.154 | 0.280 | 1.000 |
Missing values
Sample
| IDpol | VehPower | VehAge | DrivAge | BonusMalus | VehBrand | VehGas | Area | Density | Region | Group | Exposure | ClaimCount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2124053 | 5.0 | 1.0 | 31.0 | 60.0 | B2 | Diesel | C | 393.0 | Centre | 2 | 0.53 | 0 |
| 1 | 1049168 | 4.0 | 2.0 | 73.0 | 50.0 | B12 | Regular | D | 983.0 | Pays-de-la-Loire | 3 | 0.10 | 0 |
| 2 | 134313 | 4.0 | 11.0 | 60.0 | 62.0 | B1 | Regular | E | 3744.0 | Provence-Alpes-Cotes-D'Azur | 1 | 1.00 | 0 |
| 3 | 1145209 | 7.0 | 9.0 | 37.0 | 50.0 | B12 | Regular | C | 204.0 | Pays-de-la-Loire | 1 | 0.06 | 0 |
| 4 | 2281532 | 5.0 | 4.0 | 43.0 | 54.0 | B1 | Diesel | E | 3317.0 | Provence-Alpes-Cotes-D'Azur | 3 | 0.50 | 0 |
| 5 | 4122208 | 7.0 | 15.0 | 74.0 | 50.0 | B1 | Regular | A | 45.0 | Centre | 1 | 1.00 | 0 |
| 6 | 4128877 | 4.0 | 8.0 | 28.0 | 76.0 | B2 | Regular | E | 3688.0 | Rhone-Alpes | 4 | 0.50 | 0 |
| 7 | 2102858 | 7.0 | 14.0 | 20.0 | 100.0 | B1 | Regular | D | 1329.0 | Ile-de-France | 5 | 0.14 | 0 |
| 8 | 1106637 | 6.0 | 0.0 | 53.0 | 58.0 | B1 | Diesel | C | 433.0 | Provence-Alpes-Cotes-D'Azur | 4 | 0.78 | 0 |
| 9 | 3166307 | 5.0 | 17.0 | 40.0 | 83.0 | B2 | Regular | A | 10.0 | Auvergne | 2 | 0.43 | 0 |
| IDpol | VehPower | VehAge | DrivAge | BonusMalus | VehBrand | VehGas | Area | Density | Region | Group | Exposure | ClaimCount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 678002 | 4120049 | 6.0 | 6.0 | 70.0 | 64.0 | B2 | Diesel | C | 229.0 | Bretagne | 3 | 1.00 | 0 |
| 678003 | 2107318 | 7.0 | 2.0 | 24.0 | 95.0 | B13 | Regular | C | 226.0 | Centre | 4 | 0.04 | 0 |
| 678004 | 4020583 | 8.0 | 8.0 | 27.0 | 85.0 | B12 | Regular | F | 17140.0 | Ile-de-France | 2 | 0.12 | 0 |
| 678005 | 1034643 | 5.0 | 11.0 | 58.0 | 80.0 | B2 | Regular | C | 280.0 | Centre | 5 | 1.00 | 0 |
| 678006 | 5036862 | 7.0 | 5.0 | 50.0 | 50.0 | B2 | Diesel | C | 267.0 | Ile-de-France | 3 | 0.72 | 0 |
| 678007 | 4134506 | 6.0 | 4.0 | 61.0 | 50.0 | B2 | Diesel | C | 220.0 | Rhone-Alpes | 4 | 1.00 | 0 |
| 678008 | 1037983 | 8.0 | 11.0 | 36.0 | 72.0 | B10 | Diesel | C | 282.0 | Centre | 3 | 0.04 | 0 |
| 678009 | 3197389 | 7.0 | 11.0 | 50.0 | 50.0 | B2 | Diesel | A | 9.0 | Centre | 4 | 1.00 | 0 |
| 678010 | 25934 | 7.0 | 20.0 | 34.0 | 52.0 | B1 | Diesel | C | 176.0 | Provence-Alpes-Cotes-D'Azur | 5 | 1.00 | 0 |
| 678011 | 72150 | 7.0 | 13.0 | 33.0 | 50.0 | B2 | Regular | C | 115.0 | Pays-de-la-Loire | 4 | 1.00 | 0 |